Chinese and Italian Speech Rhythm: Normalization and the CCI Algorithm
نویسندگان
چکیده
This paper re-examines the speech rhythm of Beijing Chinese and Pisa Italian by means of the Control/Compensation Index (CCI), with a view to normalizing the speech data, in order to reduce the effect of the rate factor. Two metrics were applied: (a) DnCCI, an adaptation to the CCI model of the nPVI normalization strategy; (b) SnCCI, a z-score normalization, which takes into account the actual constitution of each Vand Cinterval, by referring the individual segment’s duration to the mean duration of the members of the corresponding natural phoneme class. The results indicate the advantage of the SnCCI metrics as a normalization strategy.
منابع مشابه
Modeling the Speech Rhythm of Beijing Chinese in the CCI Framework
This study describes the application of CCI (Control/Compensation Index) [3, 4] to a corpus of spontaneous Beijing Chinese. CCI is a modification of the PVI algorithm [8], devised to provide an improved representation of the rhythmic tendencies of natural languages. The CCI algorithm was previously applied to the modeling of Italian [3, 5]. The present findings refer to Beijing Chinese.
متن کاملComputation of L2 Speech Rhythm Based on Duration and Fundamental Frequency
Rhythmic characteristics of speech vary between native and non-native speakers. Studies comparing the rhythmic properties of L1 and L2 speech based on rhythm metrics have shown that this relationship is far from straightforward. It seems evidently the case that the difference between native and non-native speech is a complex interaction of a variety of rhythmic cues (duration, F0 and intensity)...
متن کاملSpeech Enhancement by Modified Convex Combination of Fractional Adaptive Filtering
This paper presents new adaptive filtering techniques used in speech enhancement system. Adaptive filtering schemes are subjected to different trade-offs regarding their steady-state misadjustment, speed of convergence, and tracking performance. Fractional Least-Mean-Square (FLMS) is a new adaptive algorithm which has better performance than the conventional LMS algorithm. Normalization of LMS ...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملSpoken English assessment system for non-native speakers using acoustic and prosodic features
The absence of real-time and targeted feedback is often critical in spoken foreign language learning. Computer-assisted language assessment systems are playing an ever more important role in this domain. This work considers the idiosyncratic pronunciation patterns of Chinese English speakers and uses both acoustic and prosody features to capture pronunciation, word stress, and rhythm informatio...
متن کامل